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Abstract 

The premier alternative to the dark matter paradigm is modified gravity. 
Following an introduction to the relevant phenomenology of galaxies, I re- 
view the MOND paradigm, an effective summary of the observations which 
any theory must reproduce. A simple nonlinear modified gravity theory does 
justice to MOND at the nonrelativistic level, but cannot be elevated to the 
relativistic level in a unique way. I go in detail into the covariant tensor- 
vector-theory (TeVe S) which not only recovers MOND but can also deal 
in detail with gravitational lensing and cosmology. Problems with MOND 
and TeVeS at the level of clusters of galaxies are given attention. I also 
summarize the status of TeVe S cosmology. 



1. Introduction 

A look at the other papers in this volume will show the present one to be 
singular. Dark matter is a prevalent paradigm. So why do we need to dis- 
cuss alternatives ? While observations seem to suggest that disk galaxies 
are embedded in giant halos of dark matter (DM), this is just an inference 
from accepted Newtonian gravitational theory. Thus if we are missing un- 
derstanding about gravity on galactic scales, the mentioned inference may 
be deeply flawed. And then we must remember that, aside for some reports 
which always seem to contradict established bounds, DM is not seen directly. 
Finally, were we to put all our hope on the DM paradigm, we would be ig- 
noring a great lesson from the history of science: accepted understanding of 
a phenomenon has usually come through confrontation of rather contrasting 
paradigms. 

To construct a competing paradigm to DM, it is best to bear in mind con- 
crete empirical facts. Newtonian gravity with the visible matter as source of 
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Fig. 1.1. Examples of rotation curves of nearby spiral galaxies from Sofue and Rubin 
(2001). These resulted from combining Doppler data from CO molecular lines for 
the central regions, optical lines for the disks, and HI 21 cm line for the outer 
gaseous regions. The galactocentric radius R is in kiloparsec (1 kpc 3000 light 
years). Graph reprinted, with permission, from the Annual Review of Astronomy 
and Astrophysics, Volume 31 (c)2001 by Annual Reviews www.annualreviews.org. 

the Poisson equation properly describes all observed systems from asteroid 
scale up to the scale of the globular clusters of stars (~ 10 5 stars bound 
together in a ball the size of a few tens of light years). But as we move 
up to galaxies, ours or external ones, troubles appear. In essence the way 
disk-like galaxies rotate is incompatible with the Newtonian gravitational 
force generated by only the visible stars, gas and dust. From the centrally 
concentrated light distribution of the typical disk galaxy we would expect 
a rotation linear velocity which first rises with galactocentric radius r and 
then drops asymptotically as r -1 / 2 . But as clear from Fig ll.H most disk 
galaxy rotation curves become flat and stay so to the outermost measured 
point, which generally lies well beyond the edge of the optical galaxy. And 
typically the mass lying within the last measured point of the rotation curve, 
as calculated a la Newton, is at least an order of magnitude larger than the 
baryonic mass actually seen. These are empirical facts begging explanation. 
Dark matter, if appropriately distributed in each case, can explain the shape 
and scale of rotation curves. But the required mass distributions are not al- 
ways reasonable from the point of view of galactogenesis. e.g., the predicted 
central cusps in density are not observed. 

Another fact to be explained is the Tully-Fisher law for disc galaxies. 
Originally discovered as a correlation between blue luminosity and the peak 
rotation velocity of a disk, it has metamorphosed into McGaugh's baryonic 
Tully-Fisher law: the total mass in visible stars and gas (baryonic mass) in 
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Fig. 1.2. The baryonic Tully-Fisher correlation from McGaugh (2005) for disk 
galaxies. Mb is the total baryonic (stars plus gas) mass; Vf is the asymptotic 
rotation velocity. For galaxies from Sanders and McGaugh (2002) (shaded circles) 
the mass in stars comes from a fit of the shape of the rotation curve with MOND. 
For the eight dwarf spirals (unshaded circles) the mass in stars (relatively small) 
is inferred directly from the luminosity. The solid line, with slope 4, is MOND's 
prediction. Graph reproduced by permission of the American Astronomical Society. 



a disc galaxy is accurately proportional to the fourth power of the asymptotic 
(terminal) rotational velocity of that galaxy. This law extends over six orders 
of magnitude in mass, and is a tight correlation as can be appreciated from 
Fig jl .21 In the low surface brightness galaxies (LSBs), those with low cen- 
tral brightness per unit area (typically below 21.65 magnitudes per square 
arcsec), the rotation curve it still on the rise at the last measured point. Yet 
LSBs, which can be luminous and massive or small dim galaxies, all fall on 
the same Tully-Fisher relation as more conventional disks (McGaugh 2005). 

Within the DM paradigm the Tully-Fisher law must arise from galaxy 
formation since it connects luminosity of baryonic matter with a dynamical 
property, rotation, which is seen as dominated by the DM halo. But it has 
not been easy to derive Tully-Fisher from any natural connection between 
the two components. And as R. H. Sanders has pointed out, the messiness of 
galaxy formation is hardly the natural backdrop for such a sharp correlation 
between galaxy properties. The sharpness needs a dynamical reason as 
opposed to an evolutionary one. 

Turn from galaxies to clusters of galaxies. In these systems, containing 
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sometimes hundreds of galaxies and much hot intergalactic gas, the New- 
tonian virial theorem can be used to estimate the cluster's mass from the 
velocities of galaxies in the cluster. The determined masses are very large 
compared to the mass seen directly as galaxies and hot gas. The mass dis- 
crepancy also shows up when the overall cluster mass is determined a la 
Newton from the assumption that the hot gas is in a hydrostatic state, or 
when gravitational lensing by a cluster is analyzed in the framework of gen- 
eral relativity (GR). The conventional solution is to assume that the typical 
cluster contains DM to the tune of about five times the visible mass. 

There are other aspects of the missing mass problem, but the above will 
furnish enough background for the ensuing discussion here. The questions 
before us are two. Are there other scenarios, apart from DM, which can 
account for all the mentioned facts. And can one single out a particular 
alternative scenario as especially promising, both in terms of physical basis, 
and explanatory success? 



The MOND scheme or paradigm of Milgrom(1983a,b,c) serves first and fore- 
most as an effective summary of much extragalactic phenomenology. Mil- 
grom introduced a preferred scale of acceleration ao ~ 10~ 10 ms~ 2 of the 
order of the centripetal accelerations of stars and gas clouds in the outskirts 
of disk galaxies. In terms of it MOND relates the acceleration a of a test 
particle to the ambient Newtonian gravitational field — V<J>7v generated by 
the baryonic mass density alone by 



Milgrom assumes that the positive smooth monotonic function jl approx- 
imately equals its argument when this is small compared to unity (deep 
MOND regime), but tends to unity when that argument is large compared 
to unity (Fig JOjl . 

The stars or gas clouds orbiting in the disk of a spiral galaxy of baryonic 
mass M at radius r from its center do so in nearly circular motion with 
velocity V(r). Obviously \a\ = V(r) 2 /r. Sufficiently outside the main mass 
distribution we may estimate |V<£jv| ~ GM/r 2 . And at sufficiently large 
r, \a\ will drop below ao and we should be able to approximate fx(x) ~ x. 
Putting all this together gives F(r) 4 /r 2 GMao/r 2 . It follows, first, that 
well outside the main mass distribution V(r) must become independent of 
r, that is, the rotation curve must flatten at some value Vf, in agreement 



2. The MOND scheme 
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Fig. 1.3. Two widespread choices for Milgrom's interpolation function: the "simple" 
function fi{x) — x/(l + x) (solid) and the "standard" function ji{x) — xj \J\ + x 2 
(dashed). The dotted line corresponds to strict Newtonian behavior, obtainable 
also as the limit a -> of MOND. 

with observations. Second, from the coefficients it follows that 



But this is just McGaugh's baryonic Tuny-Fisher law (Fig. [L2]). The MOND 
predicted proportionality coefficient (Gao) -1 agrees well with the measured 
coefficient of the baryonic Tully-Fisher law, namely 50 M km -4 s~ 4 . Thus, 
MOND's single formula unifies two central facts of spiral galaxy phenomenol- 



Of course a devil's advocate could claim that MOND works well for the 
above phenomena because the low argument form of fl(x) and the value of ao 
have been rigged to obtain these results: after all, both flat rotation curves 
and early forms of the Tully-Fisher law were already known before MOND 
was formulated. Note, however, that the supposed prearrangement need 
not guarantee that the parameter ao needed to recover the observed Tully- 
Fisher law should have anything to do with the detailed shape of rotation 
curves interior to the flat regions. Yet MOND is singularly successful in 
explaining the detailed shapes of rotation curves, as made clear by Fig ll.4i 
The observed points are radio determined velocities. The MOND predictions 
were made using the "simple" form of fi(x). Once the value of ao is adopted, 
the only parameter that need be adjusted to fit the velocities using the 
observed photometry is the stellar mass-to-luminosity ratio T of each disk. 
The required Ts turn out, in most cases, to be reasonable in view of stellar 
evolutionary models (Sanders and McGaugh 2002). So MOND is a successful 
and consistent one-parameter-per-galaxy paradigm. By contrast DM halo 
models usually require three fitting parameters, e.g. T, length scale, and 
central velocity dispersion, and yet they do not do a better job than MOND. 
For disk galaxies MOND is more economical, and more falsiflable, than the 
DM paradigm. 

One cannot emphasize enough that a priori the ao that enters in the Tully- 
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Fig. 1.4. MOND fits to the measured rotation curves of four disk galaxies from the 
Uppsala catalog modeled by the method of Sanders and Noordermeer (2007). Each 
solid curve is the fit (based on the simple function jx of Fig. 11.31) generated by the 
distribution of stars and neutral hydrogen. Long-dashed, dotted and short-dashed 
curves give the Newtonian rotation curves generated separately by the galaxy's 
bulge, disc and gas, respectively. Figure courtesy of R. H. Sanders. 



Fisher law is, in principle, a different parameter from that which determines 
the shape of rotation curves. The value ao = 1.2 • 10~ 10 ms~ 2 that needs 
to be adopted for good fits to over one hundred galaxies agrees very well 
with the ao determined from the baryonic Tully-Fisher law. It is only in 
the MOND paradigm that the two roles of ao have a common origin, and 
observations agree that there is only one ao- 

Yet a third role for ao results from Milg rom's observation that S m — 
clq/G sets a special scale for mass surface density. Wherever in a system 
the actual surface mass density drops below E m , Newtonian gravitational 
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behavior gives way to MOND dynamics (Milgrom 1983b). In galaxies like 
ours this occurs some way out in the disk; that is why the rotation curves 
may exhibit a brief drop (attempting to act a la Newton) before becoming 
flat asymptotically as befits MOND. Of course, if at the center of a disk the 
surface mass density is already below S m , then MOND holds sway over the 
whole disk. Thus Milgrom predicted that were disk galaxies to exist with 
surface mass densities everywhere below £ m , they should show especially 
large acceleration discrepancies coming from the fact that ft <C 1 everywhere 
in them. Likewise he noted that the shapes of the rotation curves should 
be independent of the precise way jl(x) changes from a linear function to 
a constant (unity) since the nonlinear range of \x does not come into play. 
A population of such galaxies became known in the late 1980's (Begeman 
et al. 1991); these are none other than the LSBs. Much subsequent work 
has confirmed both facets of Milgrom's prediction (McGaugh and de Blok 
1998). In particular, it is easy to understand why in these galaxies the 
rotation curves are still on the rise at the outermost observed point. 

Use of the MOND analog to the virial theorem (Milgrom 1994a) to esti- 
mate the masses of clusters softens, but does not fully resolve, the accelera- 
tion discrepancy. Clusters still seem to contain a factor of 2-3 more matter 
than is actually observed in all known forms (Sanders and McGaugh 2002). 
One should keep in mind that clusters may contain much invisible matter 
of rather prosaic nature, either baryons in a form which is hard to detect 
optically, or massive neutrinos (Sanders 2003, 2007; Pointecouteau 2006) 
so that, while troubling, the above lingering discrepancy should not stymie 
further investigation of MOND. 

Excellent reviews exist of other phenomenological successes of MOND 
(Sanders and McGaugh 2002, McGaugh 2006, Scarpa 2006, Milgrom 2008). 
So I forego further review of MOND's implications, and turn to ask the 
obvious question. 

3. Modified gravity theory for MOND 

There is no doubt that MOND is a useful paradigm for summarizing ex- 
tragalactic data. But what is the physical basis of its success? The most 
conservative answer is that MOND is merely a happy summary of how much 
DM there is in galaxies, and how it is spatially distributed in the various 
types of objects. Surprisingly, this minimal interpretation, which would 
surely excite no opposition from conventional astrophysicists, does not hold 
out well. First, such interpretation should explain how the scale oq enters 
into spiral galaxy properties through some regularity in the formation pro- 
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cess of DM halos. Yet detailed arguments have shown that the halos cannot 
have an intrinsic scale (Sanders and Begeman 1994). In addition, it is hard 
to see how such an interpretation could give an account of the multiple roles 
of ao alluded to above (Milgrom 2002) . 

Another interpretation of MOND (Milgrom 1983b, 1994b) is that it reflects 
modified inertia, e.g. Eq. (jl.ip represents a modification of the Newtonian 
law / = ma, whether the force / is purely gravitational or includes other 
forces. In evaluating this alternative it is important to realize that the 
MOND formula cannot be exact. For in a binary system with unequal 
masses, m\ and the time derivative of m\V\ + miv<i, as calculated 
from Eq. (jl.l|) . does not vanish in general since the inertial factors /x's will 
generally be unequal. In light of this failure of momentum conservation, 
Milgrom suggested that Eq. (jl.ip is meaningful only for test particle motion 
in a background gravity field. While this restriction is not onerous if all one 
wants is to analyze the rotation curve of a galaxy, it would put problems 
such as the dynamics of a binary galaxy system out of the reach of MOND. 

Even as a theory of test particle motion, the modified inertia interpreta- 
tion of Eq. (jl.ip is problematic. Milgrom (1994b) showed rigorously that 
no Lagrangian exists for it which (a) is Galilean invariant, (b) generates 
Newtonian dynamics for ao — > and (c) generates deep MOND dynamics 
when ao — ?• oo. (If one instead demands relativistic invariance, the desired 
Lagrangian is even more remote (Domokos 1996)). To break this impasse 
Milgrom suggested giving up the locality requirement and instead deriving 
the modified inertia from a nonlocal action functional, one which is not 
an integral over a Lagrangian density. More recently Milgrom (2006) has 
suggested, as an alternative route, to give up Galilean invariance in order 
to make a Lagrangian possible. Clearly the modified inertia viewpoint of 
MOND is a difficult one to formalize. 

When introducing MOND Milgrom also suggested that it may instead 
represent a modification of the Newtonian law of gravitation. To see this 
just replace the r.h.s. of ma = f, with a non-Newtonian gravitational force 
which depends nonlinearly on its sources in such a way that the content of 
this equation coincides that of Eq. (jl.ip . Of course, the resulting theory 
will also flout the conservation of momentum for the same reason as above. 
This defect is easily repaired by deriving the requisite theory from a suitable 
Lagrangian (Bekenstein and Milgrom 1984). 

First one stipulates that a particle's acceleration a is given by — where 

is some potential distinct from Newton's; in this way the motion is con- 
servative. As the Lagrangian for $ one selects the simplest generalization of 
the Newtonian field Lagrangian which is still rotationally and Galilean in- 
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variant, but depends on a scale of acceleration ao which should be identified 
with MOND's: 

Here .F is some positive function and p is the visible matter's mass density. 
This Aquadratic Lagrangian theory (AQUAL) reduces to Newton's for the 
choice F(X 2 ) = X 2 , when it readily leads to Poisson's equation for 
Generically it yields the equation 

V • [/2(|V$|/a )V$] = 4vrGp, (1.4) 
p{X) = dF(X 2 )/d(X 2 ). (1.5) 

The p function here should be identified with Milgrom's p for the following 
reason. 

Comparison of the AQUAL equation, Eq. (|1.4p . with Poisson's for the 
same p allows to determine its first integral, 

/2(|V$|/ao)VS = V$ N + V xh, (1.6) 

where h is some calculable vector field. Now in situations with spherical 
or cylindrical symmetry h must vanish by symmetry, and it is plain to see 
that Eq. (|1.6|) reduces to the MOND equation because a = — V<I>. Thus, at 
least for highly symmetric situations, the MOND equation can be recovered 
exactly; in more realistic geometries it acquires some "corrections" . AQUAL 
has the correct limits if we stipulate that F(X 2 ) — > X 2 for |V$| 3> «o so 
that p is constant (Newtonian regime), and F{X 2 ) — > |X 3//2 for |V3>| -C ao 
so that /x(|V$|/oo) ->■ |V<3?|/a in this deep MOND regime. 

At present AQUAL is the best available embodiment of the MOND idea 
in the nonrelativistic regime. Being Lagrangian based, AQUAL respects the 
energy, momentum and angular momentum conservation laws. It removes 
an ambiguity inherent in the MOND formula: how is a composite object, 
e.g. a star, to move if its constituents (ions) are subject to strong collision 
related accelerations (S> ao), but the composite moves as a whole in a weak 
gravitational field (<C ao), e.g. the far field of the Galaxy? AQUAL makes 
it clear that, regardless of the internal makeup, the composite's center of 
mass moves in the ambient field according to the MOND formula (Beken- 
stein and Milgrom 1984). This is in harmony with the fact that all sorts 
of stars, gas clouds, etc. delineate one and the same rotation curve for a 
galaxy. AQUAL has permitted the calculation of the MOND force between 
two galaxies in a binary from first principles (Milgrom 1986, 1994a). It 
establishes the existence of the conjectured "external field effect", namely, 
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the partial suppression of MOND effects in a system with internal acceler- 
ations weak on scale ao which happens to be immersed in a gravitational 
field strong on scale ao (Bekenstein and Milgrom 1984). This may well ex- 
plain the total absence of missing mass effects in the open galactic clusters, 
systems with weak internal accelerations (Milgrom 1983a). 

Newtonian disk instabilities originally motivated the idea that DM halos 
surround disk galaxies, and by dint of their dominant gravity were thought 
to moderate the dangerous instabilities. Using a mixture of MOND formula 
and AQUAL equation arguments, Milgrom (1989) showed analytically that 
MOND enhances local stability of disks against perturbations as compared 
to the Newtonian situation. And AQUAL has opened the door to numerical 
simulations of stellar systems within the MOND paradigm (Milgrom 1986) 
which help to explore complex phenomena like stability. Thus Brada and 
Milgrom (1999) combined a A-body code with a numerical solver for the 
AQUAL equation to verify that MOND stabilizes galaxy discs locally, and 
to demonstrate that it enhances global stability as well. The degree of 
stabilization saturates in the deep MOND regime so no disk is absolutely 
stable in MOND. This agrees with the fact that a fraction of disk galaxies 
have bars which must have arisen from instabilities. 

More recently the Paris group of Tiret and Combes studied evolution of 
bars in spiral disks using their own AQUAL based A-body code. They 
find that bars actually form more rapidly in MOND than in DM halo New- 
tonian models, but that MOND bars then weaken as compared to those 
in Newtonian models. The long term distribution of bar strengths pre- 
dicted by MOND is in better agreement with the observed distribution than 
that obtained from Newtonian halo models (Tiret and Combes 2007, 2008). 
Another N-body simulator based on AQUAL has been developed by the 
Bologna school of Ciotti, Nipoti and Londrillo, and used in a number of 
investigations. In studying dissipationless collapse according to AQUAL, 
they find that the profiles of the ensuing objects look like those of real ellip- 
tical galaxies, and when the resulting dynamics are interpreted Newtonially, 
the objects seem to contain little DM, as indeed found in real life (Nipoti 
et al. 2007a). In their study of merging galaxies with AQUAL they find 
that merging is slower than that in Newtonian theory (because of the lack 
of DM generated dynamical friction). This may be a problem for MOND 
because of widespread evidence that galaxies merge fast. However, the out- 
come of a merger is indistinguishable from the merger product in Newto- 
nian theory with DM (Nipoti et al. 2007b). In joint work with Binney 
(Nipoti et al. 2008) the Bologna group have investigated dynamical friction 
with the AQUAL based code confirming the conclusion of an earlier analytic 
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study (Ciotti and Binney 2005) that MOND shortens the dynamical friction 
timescale as compared to that of a system with the same phase space distri- 
bution in Newtonian theory with DM. The situation with regard to slowing 
of bars and mergers is satisfactory; however, a persistent problem remains 
with regard to the too brief timescale for digestion of globular clusters in 
the dwarf spheroidal galaxies (Ciotti and Binney 2004, Sanchez-Salcedo et 
al. 2006). 

As could be expected, AQUAL, being nothing but a reformulation of the 
MOND equation, does not resolve the acceleration discrepancy in clusters 
of galaxies. We shall return to this at the end. For a more comprehen- 
sive review of the AQUAL representation of MOND, the reader is referred 
elsewhere (Bekenstein 2006). 

4. TeVe S and other relativistic MOND theories 

Neither the MOND formula nor AQUAL pretend to encompass relativistic 
phenomena. Yet we need to deal with gravitational lenses, gravitational 
waves, and cosmology, all relativistic phenomena connected with DM. How 
to make the MOND idea relativistic? Since AQUAL has had significant 
success, we should start with it. 

The obvious way to make AQUAL into a covariant theory is to let the 
AQUAL potential $ metamorphose into a scalar field (ft whose Lagrangian 
is the covariant version of Eq. (|1.3|) . Einstein's equations of GR are retained 
so that the desired theory will approximate standard gravity theory, and so 
partake of the latter's many successes, e.g. in the solar system. To achieve 
the requisite deviation from exact GR, which after all cannot reduce to 
MOND, one couples matter in the new theory to (ft by the simple technique 
of writing the matter Lagrangian entirely with the new metric g a p = e 2 ^ g a @ , 
where henceforth we set c = 1. This program was carried out (Bekenstein 
and Milgrom 1984) but the resulting relativistic AQUAL turns out to have 
superluminal propagation (of (ft waves). Even more disconcerting is its failure 
to deal with the observational fact that, whatever it is that enhances the pull 
of galaxies and clusters on their components, also significantly enhances their 
gravitational lensing power (Bekenstein 1988). The root of this problem 
is the conformal relation between the two metrics: it is well known that 
conformally related metrics are equivalent insofar as light propagation is 
concerned. 

It took a while to gather together the elements for a viable relativistic 
embodiment of MOND. One of these is a timelike 4- vector field with unit 
norm, U a , first proposed by Sanders (1997). It is needed to break the above 
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mentioned conformal relation, which Sanders replaced by 

~g a p = e~ 2(t) g a p - (e 2 * - e-**)U a U p . (1.7) 

The spacetime described by g a p is related to that of g a p by a stretching in 
the direction of U a = g a ^Up and a compression orthogonal to it. Sanders 
regarded U a as established by the cosmos, and as pointing in the time di- 
rection. With this proviso, the problem with the lensing can be cured. 

However, the prescription that a 4-vector point in the time direction is 
not a covariant one. Instead one should endow U a with covariant dynamics 
of its own. In the tensor- vector scalar theory, or TeVe S (Bekenstein 2004a 
and 2004b, Bekenstein and Sanders 2006), these dynamics are derived from 
a covariant action of the general form {K and K are dimensionless coupling 
constants) 

S v = J [Kg^g^U [aA U [M +K{g^U^) 2 (1.8) 

- 2\{g^U lx U v + l)]{-g) 1 l 2 d i x. (1.9) 

The first term in the integrand is a Maxwell-like one; it takes care of approx- 
imately aligning U a with the 4-velocity of matter in the region in question, 
this in lieu of the noncovariant requirement that U a point in the time di- 
rection. But the "gauge freedom" inherent in this term turns out to cause 
integral lines of U a to form caustics (Contaldi et al. 2008). Thus in later 
work the second term in the integrand is introduced to break the gauge free- 
dom. Finally, the last term is a constraint (A being a Lagrange multiplier 
function) which forces U a to have unit negative norm with respect to metric 
g a p, and so to always be timelike. Zlosnik, et al. (2006) have shown how 
to reformulate TeVe S solely in terms of the metric g a p (which turns out to 
satisfy Einstein-like equations), and the vector field; the <fi is eliminated with 
the help of the above constraint. In this form TeVeS might figuratively be 
described as GR with a DM (the vector field). However, the second form is 
very intricate. We continue to discuss TeVe S in its original form. 

The dynamics of <p in relativistic AQUAL must be modified to obviate the 
superluminal propagation (Bekenstein 2004a). Here we use the notation of 
later work (Sagi and Bekenstein 2008) to write the scalar action as 

s s = J ^{ke 2 h a ^, a 4>, p ) V^gd'x, (1.10) 

where J 7 is a positive function, h a/3 = g a/3 —U a U^ , k is another dimensionless 
coupling constant, and £ is a constant scale of length. Each choice of the 
function F{y) defines a separate TeVeS theory; fi(y) = dJ r (y)/dy functions 
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somewhat like the ji{x) in MOND. We need only consider functions such 
that T > and fi > for either positive or negative arguments. The correct 
MOND behavior will emerge if /i(y) — > 1 for y — > oo and /u(y) ~ Dyjy for 
< y <C 1 with D a dimensionless positive constant. The action of TeVeS 
comprises S v , S s as well as the customary Einstein-Hilbert action for 
and the matter action written in the usual fashion entirely in terms of g a p. 

Consequences of TeVe S with the above sort of T have been investigated 
mostly for the case K = 0. TeVe S reduces to GR in the parameter regime 
k -> 0, K oc k and £ oc k~ 3 ^ 2 (Bekenstein 2004a). As a consequence there 
is a nonrelativistic quasistationary regime in which TeVeS is Newtonian, 
and which serves to describe the overall features of Earth and solar system 
gravity. In contrast to many gravity theories with a scalar sector, TeVe S ev- 
idences no cosmological evolution of the Newtonian "constant" Gn (Beken- 
stein and Sagi 2008). My original choice of F{y) involves a singular passage 
into the cosmological domain; more successful choices have been propounded 
by Zhao and Famaey (2006) and Sanders (2006). 

The post-Newtonian behavior of TeVe S in terms of the celebrated parametrized 
post-Newtonian coefficients has been investigated (Bekenstein 2004a and 
2005, Giannios 2005, Sanders 2006, Tamaki 2008). The j3 and 7 param- 
eters exactly agree with those of GR. The parameters Ci ; C2 , C3 : C4 an d «3 
are all expected to vanish here as in GR because TeVe S is derived from an 
action principle, and is thus a conservative theory. Eva Sagi (unpublished 
Ph. D. work) has shown that the preferred location parameter £ vanishes 
too. There are strong indications that the preferred frame coefficients a% 
and a.1 are non-vanishing, but extant calculations of them are in want of 
consistency. Reliable calculation of these two coefficients is a high priority 
because experimental bounds on them are strong. 

Out of the parameters of TeVe S one can construct the MOND scale of 
acceleration. Omitting a factor close to unity and reinstating c this is 

a ° = ■ (LU) 

The Newtonian regime mentioned above sets in when the naive gravitational 
field is well above Cq. For weaker ones which are not far from spherically 
symmetric, the gravitational field obeys an equation like (jl.4p with ji directly 
related to the TeVe S fi function (Bekenstein 2004a, Bekenstein and Sagi 
2008). Our stipulation of the small argument limit of T leads automatically 
to the extreme MOND limit for fields <C a®. With a suitable form of J- at 
intermediate arguments, TeVe S thus reproduces the essence of AQUAL and 
MOND, and inherits their success in the theater of galaxies. To conclude one 
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should mention that when worked out in detail, the scale ao is found to have 
a rather weak dependence on epoch of the universal expansion (Bekenstein 
and Sagi 2008). 

Sanders (2005) has proposed a variant of TeVe S, a bi-scalar tensor vector 
theory (BSTV) with three free functions and a free parameter. BSTV is a 
more appropriate frame for generating cosmological evolution of ao- Since 
numerically ao ~ cHq, it is often argued that this scale must be determined 
by cosmology, and should thus vary on a Hubble timescale (Milgrom 1983a). 
As mentioned, in TeVeS ao evolves very slowly, but its change is faster in 
BSTV. Discrimination between the two behaviors may be possible with good 
rotation curves of disk galaxies at redshifts z = 2 — 5. Such curves are just 
now coming into range. 

Zlosnik, et al. (2007) have also proposed a variant of TeVeS, a tensor- 
vector theory in which the timelike vector is normalized with respect to g a p. 
The vector's action is taken to be a function F of /C, the quadratic form in 
the derivatives of the vector field from Einstein- Aether theories. The theory 
has four parameters: a length scale and three dimensionless parameters. 
The form of F(JC) can be deduced approximately from the requirement that 
MOND arise in the nonrelativistic quasistatic limit, and from the stipulation 
that a cosmology built on this theory shall have an early inflationary period 
and an accelerated expansion at late times. 

5. Gravitational lenses and cosmology in TeVeS 

How does TeVe S measure up to the task of describing gravitational lensing? 
Because all matter and field actions, apart of those for g a p,4> an( i U a , are 
constructed with the metric g a p , light rays in TeVe S are null geodesies of 
g a p, which is built out oiU a ,4> and g a p. Since the lensing by galaxies and 
clusters of galaxies spans cosmological distances, one must first understand 
a little about TeVe S cosmology. In its isotropic cosmological models the 
U a is pointed precisely in the time direction. Consequently, models with 
baryonic matter content alone tend to be very similar to the corresponding 
GR models because the scalar 0's energy density stays small (Bekenstein 
2004a, Chiu et al. 2006, Zhao, Bacon et al. 2006) while the vector's is nill. 
According to Chiu et al. these models give a reasonable relation between 
redshift and angular distances, and provide just as good a scaffolding for 
the analysis of cosmologically distant gravitational lenses as do GR models. 

As in isotropic cosmological models, so in static situations like the envi- 
ronment of a galaxy, the vector U a is pointed precisely in the time direction. 
To compute the light ray deflection in linearized theory one also needs the 
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scalar field eft and the metric g a s, both to first order in the Newtonian grav- 
itational potential <J>jv. The line element takes the form (Bekenstein 2004a, 
Bekenstein and Sagi 2008) 

ds 2 = -(1 + 2<5>)dt 2 + (1 - 2§){dx 2 + dy 2 + dz 2 ), (1.12) 

where $ = &n+4>; here we again omit the mentioned nearly unity factor. (In 
linearized GR one gets the same form of metric, but with in place of $.) 
Since the same potential $ appears in both terms of this isotropic form of 
the line element, the light ray bending, which leans on both to equal degree, 
measures the same gravitational potential as do nonrelativistic dynamics, 
which are themselves sensitive only to the temporal part of the line element. 
Thus in TeVeS an extragalactic system lenses light, or radio waves, just 
as would GR, were the latter supplemented by DM in the amount and 
with the distribution necessary to reproduce the observed galactic dynamics 
(Bekenstein 2004a). 

In GR <3?jv is all there is, and its Laplacian, as determined from the lens- 
ing observations or from the dynamics, will give the total mass distribution. 
But DM is not visible directly, so its mass distribution (inferred by sub- 
tracting the observed baryonic component) cannot be checked; its status is 
better described with the terminology of physical plausibility. By contrast 
in TeVe S the observationally determined <& is to be broken up into two 
parts: <3? = + <fi. The $jv part is given by Poisson's equation, the <p part 
by a nonlinear AQUAL-type equation (compare Eq. (|1.10p with Eq. (|1.3p ): 
the observed baryonic mass density p is the source of both. Evidently the 
TeVe S scheme is falsifiable — by comparison of the calculated potential with 
that inferred from the lensing — to a larger extent than is the DM paradigm 
for which any discrepancy can be tucked away into the invisible component. 

Some features of TeVe S gravitational lensing by a pointlike mass M are 
worked out by Chiu, et al. (2006). They note that the deflection angle in 
the deep MOND regime [impact parameter b S> bo = (k/4Tr)(GM/ao) 1 ^ 2 ] 
approaches a constant, as might have been expected from naive arguments, 
but is less predictable in the intermediate MOND regime b ~ 6o- This may 
serve as a warning against doing lensing analysis with a mixture of GR 
and MOND concepts (e.g. Mortlock and Turner 2001). In analogy with 
GR, Chiu, et al. work out the lens equation for TeVe S (which controls the 
amplifications of the various images in strong lensing). For two images they 
find that the difference in amplifications is no longer unity, as in GR, and 
may depend on the masses. They also investigate the gravitational time 
delay in TeVe S which is important for interpreting differential time delays 
in doubly imaged variable quasars. 
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The baryon distribution in a galaxy is better represented by the Hernquist 
model than by a point source model. Zhao et al. (2006) employ both to 
compare TeVe S predictions with a large sample of quasars doubly imaged 
by intervening galaxies. The lensing galaxy masses are estimated by com- 
paring observations both with predicted image positions and with predicted 
amplification ratios; the two methods are found to give consistent results, 
themselves well correlated with the luminosities of the galaxies. The corre- 
sponding mass-to-light ratios are found to be in the normal range for stellar 
populations, with some exceptions. This result clashes with the claim by 
Ferreras et al. (2008) that lensing by galaxies from the very same sample can 
only be explained in MOND by including a lot of DM apart from neutrinos. 
But the last authors use a mixture of MOND and GR instead of TeVeS. 

What should the probability distribution by angular separation of the 
two images in a sample of lensed quasars? This important question has 
proved troublesome for the DM paradigm. In TeVeS it has been investi- 
gated by Chen and Zhao (2006) and lately by Chen (2008). Again modeling 
the shapes of the mostly elliptical galaxies with Hernquist profiles, and de- 
scribing their space distribution with the Fontana function, these workers 
compare predictions of both TeVe S for a purely baryonic universe with cos- 
mological constant and of GR with DM and baryons with the CLASS/ JVAS 
quasar survey. After the preliminary work the later paper reports that 
TeVeS comes out on top. All the above is accomplished with spherical 
mass models of the galaxies; a step towards the modeling of asymmetric 
lenses within TeVeS has been taken by Shan, Feix et al. (2008). 

When it comes to weak lensing (distorted but unsplit images) by clus- 
ters of galaxies, a pure MOND account is less than satisfactory. The case 
of spherically symmetric clusters is fairly summarized by Takahashi and 
Chiba (2007). In spherical symmetry the Poisson and AQUAL equations 
of weak field TeVeS for <I>jv and (f> are easily solved, and the total poten- 
tial <I> = + <f> is found to be related to <E>at by a MOND relation with 
some complicated ji function. These authors use several such interpolation 
functions to predict the shear and convergence of the light lensed by a large 
number of quasispherical clusters in terms of the visible baryonic matter, 
but fail to get a fit with observations unless they add a neutrino component 
a la Sanders (2003, 2007); the required neutrino mass is unrealistically large, 
so it seems that a DM component is needed to buttress the MOND effect. 
Similar conclusions are reached by Natarajan and Zhao (2008). 

Nonspherical cluster systems are also problematic. In the massive collid- 
ing clusters systems MACSJ0025.4-1222 (Bradac et al. 2008) and 1E0657- 
56 (Clowe et al. 2004) the galaxy components have been rudely separated 
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from the hot gas concentrations. Weak lensing mapping using background 
galaxies shows the gravitating mass to be preponderately located in the re- 
gions containing the galaxies, rather than in the gas which accounts for the 
bulk of the visible baryonic mass (Clowe et al 2006, Bradac et al 2008). Col- 
lisionless DM would indeed be expected to move together with the galaxies 
and get separated from the collisionless gas; hence the widespread inference 
that much DM exists in these systems. However, this view conflicts with the 
finding (Mahdavi et al. 2007) that in the merging clusters A520 the lensing 
center is in the hot gas which is separate from the galaxy concentration. 
Angus, Famaey et al. (2006) considered it possible to explain the lensing 
seen in 1E0657-56 by TeVe S with a reasonable purely baryonic matter dis- 
tribution, but later concluded (Angus, Shan et al. 2007) that a collisionless 
component is needed after all, with neutrinos just barely supplying a res- 
olution. This conclusion is confirmed by a careful study of Feix, Fedeli et 
al. (2008) who devised a Fourier solver for the AQUAL equation in TeVe S, 
and conclude that the source of gravity in 1E0657-56 must include an in- 
visible component. This study also shows that nonlinearity of the AQUAL 
equation cannot prevent the lensing from tracking the baryonic matter. 

The weak lensing by cluster C10024+17 provides another relevant case 
study. Jee et al. (2007) find its deduced mass surface density to exhibit 
a ring which does not coincide either with the galaxy distribution, or the 
hot gas. Again this has been hailed as graphic proof of DM. But Milgrom 
and Sanders (2008) argue that such feature is actually expected in MOND, 
lying as it does at the transition between the Newtonian and the MOND 
regime. Famaey et al. (2007) conclude that the lensing in C10024+17 can 
be modeled in MOND by including 2 eV neutrinos. A truly TeVeS model 
of C10024+17 is still outstanding. 

Turn now to cosmology. Critics of MOND used to argue that the complex 
power spectrum of cosmological perturbations of the background radiation, 
which is said to be well fit by the "concordance" DM model of the universe, 
proves that DM is essential to any rational picture of the cosmos. They 
tacitly assumed that the MOND paradigm could never measure up to the 
same test. With TeVe S on the scene this could be checked for the first time. 

In a massive work Skordis (2006) has provided the full covariant formalism 
for evolution of cosmological perturbations in TeVeS, the analog of that in 
GR. Using it Skordis et al. (2006) have shown that, without invoking DM, 
TeVeS can largely be made consistent with the observed spectrum of the 
spatial distribution of galaxies, and of the cosmic microwave radiation, if 
one allows for contributions to the energy density of massive neutrinos, and 
of a cosmological constant. The role of DM in GR cosmology is taken over 
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by a feedback mechanism involving the scalar field perturbations. Dodelson 
and Liguori (2006) have independently calculated perturbation growth, and 
claimed that it is rather the vector field in TeVe S which is responsible for 
growth of large scale structure without needing DM for this. Thus although 
the elimination of galaxy bound DM was the original motivation for MOND, 
and thus for TeVeS, the later may potentially provide a way to eliminate 
cosmological (homogeneously distributed) DM ! 

Apart from the DM mystery, cosmology furnishes us with a "dark energy" 
mystery. Dark energy is the agent responsible for the observed acceleration 
of the Hubble expansion in the context of GR cosmological models. An in- 
teresting question is whether modified gravity can supplant the dark energy. 
Here we touch only upon TeVe S related work in this direction. Diaz-Rivera 
et al. (2006) find an exact deSitter solution of TeVe S cosmology which can 
represent either early time inflation epochs or the late time acceleration era. 
Hao and Akhoury (2005) conclude that with a suitable choice of the TeVe S 
function J 7 , the scalar field can play the role of dark energy. According 
to Zhao (2006) the choice of T implicit in the work of Zhao and Famaey 
(2006) leads to cosmological models that evolve at early times like those of 
standard cold DM cosmology, and display late time acceleration with the 
correct present Hubble scale, all this without needing DM or dark energy. 
Likewise, in the related Einstein-Aether theory, Zlosnik et al. (2007) find 
that with suitable choice of their theory's T, the vector field can both drive 
early inflation as well as double for dark energy at late times. 

GR and TeVe S differ, also in the cosmological arena, in the relation they 
stipulate between the matter overdensity and the local depth of the gravi- 
tational potential. The advent of large lensing surveys may open a way to 
distinguish between these two theories, as well as between GR and other 
modified gravities, by exposing correlations between galaxy number density 
and weak lensing shear (Schmidt 2008, Zhang et al. 2007). The effect of 
the dark energy on the expansion can be separated out by comparing cos- 
mological models with the same expansion history in two theories. And the 
ultimate confrontation between GR and TeVe S cosmology may be accom- 
plished by cross-correlating galaxy number density with cosmic microwave 
background antenna temperature (Schmidt et al. 2007). 
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